NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bayesian Multilevel Latent Class Profile Analysis: Inference and Estimation for Exploring the Diverse Pathways to Academic Proficiency

https://doi.org/10.1080/00273171.2025.2501341

Lee, JungWun; McCoach, D Betsy; Harel, Ofer; Chung, Hwan (May 2025, Multivariate Behavioral Research)

Free, publicly-accessible full text available May 22, 2026
Teaching Statistical Concepts Using Computing Tools: A Review of the Literature

https://doi.org/10.1080/26939169.2024.2445541

Zavez, Katherine; Harel, Ofer (March 2025, Journal of Statistics and Data Science Education)

Free, publicly-accessible full text available March 31, 2026
Classification of Bovidae fossils from Gladysvale, South Africa using elastic shape analysis

https://doi.org/10.1016/j.jas.2024.105959

Brophy, Juliet K; Matthews, Gregory J; Schnitzler, Nicole; Bharath, Karthik; Kurtek, Sebastian; Harel, Ofer (June 2024, Journal of Archaeological Science)

Full Text Available
A latent class selection model for categorical response variables with nonignorably missing data

https://doi.org/10.4310/22-SII753

Lee, Jung Wun; Harel, Ofer (January 2024, Statistics and Its Interface)

Full Text Available
A Two-Stage Classification for Dealing with Unseen Clusters in the Testing Data

https://doi.org/10.6339/24-JDS1140

Lee, Jung Wun; Harel, Ofer (January 2024, Journal of Data Science)

Classification is an important statistical tool that has increased its importance since the emergence of the data science revolution. However, a training data set that does not capture all underlying population subgroups (or clusters) will result in biased estimates or misclassification. In this paper, we introduce a statistical and computational solution to a possible bias in classification when implemented on estimated population clusters. An unseen-cluster problem denotes the case in which the training data does not contain all underlying clusters in the population. Such a scenario may occur due to various reasons, such as sampling errors, selection bias, or emerging and disappearing population clusters. Once an unseen-cluster problem occurs, a testing observation will be misclassified because a classification rule based on the sample cannot capture a cluster not observed in the training data (sample). To overcome such issues, we suggest a two-stage classification method to ameliorate the unseen-cluster problem in classification. We suggest a test to identify the unseen-cluster problem and demonstrate the performance of the two-stage tailored classifier using simulations and a public data example.
more » « less
Full Text Available
Shape-Based Classification of Partially Observed Curves, With Applications to Anthropology

https://doi.org/10.3389/fams.2021.759622

Matthews, Gregory J.; Bharath, Karthik; Kurtek, Sebastian; Brophy, Juliet K.; Thiruvathukal, George K.; Harel, Ofer (October 2021, Frontiers in Applied Mathematics and Statistics)

We consider the problem of classifying curves when they are observed only partially on their parameter domains. We propose computational methods for (i) completion of partially observed curves; (ii) assessment of completion variability through a nonparametric multiple imputation procedure; (iii) development of nearest neighbor classifiers compatible with the completion techniques. Our contributions are founded on exploiting the geometric notion of shape of a curve, defined as those aspects of a curve that remain unchanged under translations, rotations and reparameterizations. Explicit incorporation of shape information into the computational methods plays the dual role of limiting the set of all possible completions of a curve to those with similar shape while simultaneously enabling more efficient use of training data in the classifier through shape-informed neighborhoods. Our methods are then used for taxonomic classification of partially observed curves arising from images of fossilized Bovidae teeth, obtained from a novel anthropological application concerning paleoenvironmental reconstruction.
more » « less
Full Text Available

Search for: All records